A Multi-instance Multi-label Dual Learning Approach for Video Captioning
نویسندگان
چکیده
Video captioning is a challenging task in the field of multimedia processing, which aims to generate informative natural language descriptions/captions describe video contents. Previous approaches mainly focused on capturing visual information videos using an encoder-decoder structure captions. Recently, new encoder-decoder-reconstructor was proposed for captioning, captured both and Based this, this article proposes novel multi-instance multi-label dual learning approach (MIMLDL) captions based structure. Specifically, MIMLDL contains two modules: caption generation reconstruction modules. The module utilizes lexical fully convolutional neural network (Lexical FCN) with weakly supervised mechanism learn translatable mapping between regions labels Then synthesizes sequences reproduce raw outputs module. A fine-tunes modules according gap reproduced videos. Thus, our can minimize semantic generated by minimizing differences sequences. Experimental results benchmark dataset demonstrate that improve accuracy captioning.
منابع مشابه
Multi-instance multi-label learning
In this paper, we propose the MIML (Multi-Instance Multi-Label learning) framework where an example is described by multiple instances and associated with multiple class labels. Compared to traditional learning frameworks, the MIML framework is more convenient and natural for representing complicated objects which have multiple semantic meanings. To learn from MIML examples, we propose the Miml...
متن کاملMulti-instance multi-label active learning
Multi-instance multi-label learning (MIML) has achieved success in various applications, especially those involving complicated learning objects. Along with the enhancing of expressive power, the cost of annotating a MIML example also increases significantly. In this paper, we propose a novel active learning approach to reduce the labeling cost of MIML. The approach actively query the most valu...
متن کاملLearnability of Multi - Instance Multi - Label Learning
Multi-Instance Multi-Label learning (MIML) is a new machine learning framework where one data object is described by multiple instances and associated with multiple class labels. During the past few years, many MIML algorithms have been developed and many applications have been described. However, there lacks theoretical exploration to the learnability of MIML. In this paper, through proving a ...
متن کاملFast Multi-Instance Multi-Label Learning
In multi-instance multi-label learning (MIML), one object is represented by multiple instances and simultaneously associated with multiple labels. Existing MIML approaches have been found useful in many applications; however, most of them can only handle moderatesized data. To efficiently handle large data sets, we propose the MIMLfast approach, which first constructs a low-dimensional subspace...
متن کاملA Framework of Hashing for Multi-instance Multi-label Learning
Multi-instance multi-label learning (Miml) is a powerful framework, which deals with the problem that each example is represented as multiple instances and associated with multiple class labels. Previous works mostly focus on accuracy, while scalability for large scale datasets has been rarely addressed. In this paper, we present a novel framework – Multi-instance Multi-label Hashing (MimlH) to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM Transactions on Multimedia Computing, Communications, and Applications
سال: 2021
ISSN: ['1551-6857', '1551-6865']
DOI: https://doi.org/10.1145/3446792